Divide-and-conquer based large-scale spectral clustering

نویسندگان

چکیده

Spectral clustering is one of the most popular methods. However, how to balance efficiency and effectiveness large-scale spectral with limited computing resources has not been properly solved for a long time. In this paper, we propose divide-and-conquer based method strike good between effectiveness. proposed method, landmark selection algorithm novel approximate similarity matrix approach are designed construct sparse within low computational complexities. Then results can be computed quickly through bipartite graph partition process. The achieves lower complexity than existing Experimental on ten datasets have demonstrated method. MATLAB code experimental available at https://github.com/Li-Hongmin/MyPaperWithCode.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spectral clustering for divide-and-conquer graph matching

We present a parallelized bijective graph matching algorithm that leverages seeds and is designed to match very large graphs. Our algorithm combines spectral graph embedding with existing state-of-the-art seeded graph matching procedures. We justify our approach by proving that modestly correlated, large stochastic block model random graphs are correctly matched utilizing very few seeds through...

متن کامل

Correlation clustering: divide and conquer

The correlation clustering is an NP-hard problem, hence its solving methods do not scale well. The contraction method and its improvement enable us to construct a divide and conquer algorithm, which could help us to clustering bigger sets. In this article we present the contraction method and compare the effectiveness of this new new and our old methods.

متن کامل

Reduced Complexity Divide and Conquer Algorithm for Large Scale TSPs

The Traveling Salesman Problem (TSP) is the problem of finding the shortest path passing through all given cities while only passing by each city once and finishing at the same starting city. This problem has NP-hard complexity making it extremely impractical to get the most optimal path even for problems as small as 20 cities since the number of permutations becomes too high. Many heuristic me...

متن کامل

Applying Divide and Conquer to Large Scale Pattern Recognition Tasks

Rather than presenting a speciic trick, this paper aims at providing a methodology for large scale, real-world classiication tasks involving thousands of classes and millions of training patterns. Such problems arise in speech recognition, handwriting recognition and speaker or writer identiication, just to name a few. Given the typically very large number of classes to be distinguished, many a...

متن کامل

A Novel K means Clustering Algorithm for Large Datasets Based on Divide and Conquer Technique

In this paper we propose an efficient algorithm that is based on divide and conquers technique for clustering the large datasets. In our research work we have applied divide and conquer technique on partitions of the large datasets and we have used squared Euclidean distance for measuring the similarity between data points. The partitioning of datasets is done according to the number of cluster...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2022

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2022.06.006